The RATS Collection: Supporting HLT Research with Degraded Audio Data

نویسندگان

  • David Graff
  • Kevin Walker
  • Stephanie Strassel
  • Xiaoyi Ma
  • Karen Jones
  • Ann Sawyer
چکیده

The DARPA RATS program was established to foster development of language technology systems that can perform well on speaker-to-speaker communications over radio channels that evince a wide range in the type and extent of signal variability and acoustic degradation. Creating suitable corpora to address this need poses an equally wide range of challenges for the collection, annotation and quality assessment of relevant data. This paper describes the LDC’s multi-year effort to build the RATS data collection, summarizes the content and properties of the resulting corpora, and discusses the novel problems and approaches involved in ensuring that the data would satisfy its intended use, to provide speech recordings and annotations for training and evaluating HLT systems that perform 4 specific tasks on difficult radio channels: Speech Activity Detection (SAD), Language Identification (LID), Speaker Identification (SID) and Keyword Spotting (KWS).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

TRAP language identification system for RATS phase II evaluation

Automatic language identification or detection of audio data has become an important preprocessing step for speech/speaker recognition and audio data mining. In many surveillance applications, language detection has to be performed on highly degraded audio inputs. In this paper, we present our work on language detection in highly degraded radio channel scenarios. We provide a brief description ...

متن کامل

The RATS radio traffic collection system

The DARPA RATS Program focuses on the development of new technologies for identifying and processing speaker-tospeaker communications over degraded radio channels. In order to build a corpus to address this research question, we developed a system that takes a clean source signal and transmits it over eight different radio channels, where the variation from channel to channel results in a range...

متن کامل

Tools for Collecting Speech Corpora via Mechanical-Turk

To rapidly port speech applications to new languages one of the most difficult tasks is the initial collection of sufficient speech corpora. State-of-the-art automatic speech recognition systems are typical trained on hundreds of hours of speech data. While pre-existing corpora do exist for major languages, a sufficient amount of quality speech data is not available for most world languages. Wh...

متن کامل

The South African Human Language Technology Audit

Human language technology (HLT) has been identified as a priority area by the South African government. However, despite efforts by government and the research and development (R&D) community, South Africa has not yet been able to maximise the opportunities of HLT and create a thriving HLT industry. One of the key challenges is the fact that there is insufficient codified knowledge about the cu...

متن کامل

حقوق تولیدکنندگان ابزار رسانه‌های صوتی و تصویری

Media tools include phonogram and videogram which do not create any work but they cooperate in recording the work. Therefore, phonogram and videogram producers have an important role in work consolidation. The rights of producers of audio-visual media tools are part of the Related Rights. However, most legislation only deals with supporting producers of audio media tools, and the producers of v...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014